Adjustment for the Maximum Likelihood Estimate of the Negative Binomial Dispersion Parameter
نویسندگان
چکیده
Negative Binomial (or Poisson-gamma) model has been used extensively by highway safety analysts because it can accommodate the over-dispersion, often exhibited in crash data. However, it has been reported in the literature that the maximum likelihood estimate of the dispersion parameter of NB models can be significantly affected when the data are characterized by small sample size and low sample mean. Given the important roles of the dispersion parameter in various types of highway safety analyses, there is a need to determine whether the bias could be potentially corrected or minimized. The objectives of this paper are to explore whether a systematic relationship exists between the estimated and true dispersion parameters; determine the bias as a function of the sample size and sample mean; and, develop a procedure for correcting the bias caused by these two conditions. For this purpose, simulated data were used to derive the relationship under the various combinations of sample mean, dispersion parameter and sample size, which encompass all simulation conditions performed in previous researches. The dispersion parameter was estimated using the maximum likelihood method. The results confirmed previous studies, and developed a reasonable relationship between the estimated and true dispersion parameter for reducing the bias. Details about the application of the correction procedure were also provided using the crash data collected at 458 three-legged unsignalized intersections in California. Finally, the study provided several discussion points for further work.
منابع مشابه
Estimating the Dispersion Parameter of the Negative Binomial Distribution for Analyzing Crash Data Using a Bootstrapped Maximum Likelihood Method
The objective of this study is to improve the estimation of the dispersion parameter of the negative binomial distribution for modeling motor vehicle collisions. The negative binomial distribution is widely used to model count data such as traffic crash data, which often exhibit low sample mean values and small sample sizes. Under such situations, the most commonly used methods for estimating t...
متن کاملHurdle, Inflated Poisson and Inflated Negative Binomial Regression Models for Analysis of Count Data with Extra Zeros
In this paper, we propose Hurdle regression models for analysing count responses with extra zeros. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset. In this example, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...
متن کاملSmall-sample estimation of negative binomial dispersion, with applications to SAGE data.
We derive a quantile-adjusted conditional maximum likelihood estimator for the dispersion parameter of the negative binomial distribution and compare its performance, in terms of bias, to various other methods. Our estimation scheme outperforms all other methods in very small samples, typical of those from serial analysis of gene expression studies, the motivating data for this study. The impac...
متن کاملMaximum Likelihood Estimation of the Negative Binomial Dispersion Parameter for Highly Overdispersed Data, with Applications to Infectious Diseases
BACKGROUND The negative binomial distribution is used commonly throughout biology as a model for overdispersed count data, with attention focused on the negative binomial dispersion parameter, k. A substantial literature exists on the estimation of k, but most attention has focused on datasets that are not highly overdispersed (i.e., those with k>or=1), and the accuracy of confidence intervals ...
متن کاملEstimation of Count Data using Bivariate Negative Binomial Regression Models
Abstract Negative binomial regression model (NBR) is a popular approach for modeling overdispersed count data with covariates. Several parameterizations have been performed for NBR, and the two well-known models, negative binomial-1 regression model (NBR-1) and negative binomial-2 regression model (NBR-2), have been applied. Another parameterization of NBR is negative binomial-P regression mode...
متن کامل